Graph-Based Visual-Semantic Entanglement Network for Zero-Shot Image Recognition

نویسندگان

چکیده

Zero-shot learning uses semantic attributes to connect the search space of unseen objects. In recent years, although deep convolutional network brings powerful visual modeling capabilities ZSL task, its features have severe pattern inertia and lack representation relationships, which leads bias ambiguity. response this, we propose Graph-based Visual-Semantic Entanglement Network conduct graph features, is mapped by using a knowledge graph, it contains several novel designs: 1. establishes multi-path entangled with neural (CNN) (GCN), input from CNN GCN model implicit relations, then feedback modeled information features; 2. attribute word vectors as target for GCN, forms self-consistent regression supervise learn more personalized relations; 3. fuses supplements hierarchical visual-semantic refined into embedding. Our method outperforms state-of-the-art approaches on multiple representative datasets: AwA2, CUB, SUN promoting linkage modelling features.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Graph for Zero-Shot Learning

Zero-shot learning aims to classify visual objects without any training data via knowledge transfer between seen and unseen classes. This is typically achieved by exploring a semantic embedding space where the seen and unseen classes can be related. Previous works differ in what embedding space is used and how different classes and a test image can be related. In this paper, we utilize the anno...

متن کامل

Zero-Shot Visual Recognition using Semantics-Preserving Adversarial Embedding Network

We propose a novel framework called SemanticsPreserving Adversarial Embedding Network (SP-AEN) for zero-shot visual recognition (ZSL), where test images and their classes are both unseen during training. SP-AEN aims to tackle the inherent problem — semantic loss — in the prevailing family of embedding-based ZSL, where some semantics would be discarded during training if they are nondiscriminati...

متن کامل

Alternative Semantic Representations for Zero-Shot Human Action Recognition

A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost...

متن کامل

Gaussian Visual-Linguistic Embedding for Zero-Shot Recognition

An exciting outcome of research at the intersection of language and vision is that of zeroshot learning (ZSL). ZSL promises to scale visual recognition by borrowing distributed semantic models learned from linguistic corpora and turning them into visual recognition models. However the popular word-vector DSM embeddings are relatively impoverished in their expressivity as they model each word as...

متن کامل

Zero-Shot Learning on Semantic Class Prototype Graph.

Zero-Shot Learning (ZSL) for visual recognition is typically achieved by exploiting a semantic embedding space. In such a space, both seen and unseen class labels as well as image features can be embedded so that the similarity among them can be measured directly. In this work, we consider that the key to effective ZSL is to compute an optimal distance metric in the semantic embedding space. Ex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2022

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2021.3082292